Domain adaptation for text dependent speaker verification

نویسندگان

  • Hagai Aronowitz
  • Asaf Rendel
چکیده

Recently we have investigated the use of state-of-the-art textdependent speaker verification algorithms for user authentication and obtained satisfactory results mainly by using a fair amount of text-dependent development data from the target domain. In this work we investigate the ability to build high accuracy text-dependent systems using no data at all from the target domain. Instead of using target domain data, we use resources such as TIMIT, Switchboard, and NIST data. We introduce several techniques addressing both lexical mismatch and channel mismatch. These techniques include synthesizing a universal background model according to lexical content, automatic filtering of irrelevant phonetic content, exploiting information in residual supervectors (usually discarded in the i-vector framework), and inter dataset variability modeling. These techniques reduce verification error significantly, and also improve accuracy when target domain data is available.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised learning of HMM topology for text-dependent speaker verification

Usually, text-dependent speaker verification can achieve better performance than text-independent system because of the constraint that the enrollment and testing utterance share the same phonetic content. However, the enrollment data for text-dependent system usually is very limited. Expectation Maximization(EM) training of HMM will suffer from noisy estimation because of limited enrollment. A...

متن کامل

Model adaptation methods for speaker verification

Model adaptation methods for a text-dependent speaker verification system are evaluated in this paper. The speaker verification system uses a discriminant model and a statistical model to represent each enrolled speaker. These modeling approaches consist of a neural tree network and Ganssian mixture model. Adaptation methods are evaluated for both modeling approaches. We show that the overall s...

متن کامل

Comparison of background normalization methods for text-independent speaker verification

This paper compares two approaches to background model representation for a text-independent speaker verification task using Gaussian mixture models. We compare speaker-dependent background speaker sets to the use of a universal, speaker-independent background model (UBM). For the UBM, we describe how Bayesian adaptation can be used to derive claimant speaker models, providing a structure leadi...

متن کامل

Unsupervised intra-speaker variability compensation based on Gestalt and model adaptation in speaker verification with telephone speech

In this paper an unsupervised compensation method based on Gestalt, ISVC, is proposed to address the problem of limited enrolling data and noise robustness in text-dependent speaker verification (SV). Reductions in EER and in the integral below the ROC curve as high as 20% or 40% and 30% or 60%, respectively, can be achieved by ISVC independently of the number of enrolling utterances. In contra...

متن کامل

On comparing and combining intra-speaker variability compensation and unsupervised model adaptation in speaker verification

In this paper an unsupervised intra-speaker variability compensation method, ISVC, and unsupervised model adaptation are tested to address the problem of limited enrolling data in text-dependent speaker verification. In contrast to model adaptation methods, ISVC is memoryless with respect to previous verification attempts. As shown here, unsupervised model adaptation can lead to substantial imp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014